Preprocessing of Web Usage Data for Application in Prefetching to Reduce Web Latency

نویسنده

  • G T Raju
چکیده

-The Popularity of Web resulted in heavy traffic in the Internet. This intense increase in internet traffic has caused significant increase in the user perceived latency. In order to reduce this effectively. Prefetching techniques are found to be best suitable. Prefetching technique is motivated by the fact that, in general, once a user goes to a Web site; he/she generally browses around for several pages before leaving for another site. Since the user follows hyperlinks upon his/her interests, it is likely that links are not followed uniformly. It is possible to either predict each user’s interest using cookies or mine a consensus of interests (i.e., generally what pages will be requested after the current page) with some confidence from access log files recorded by the Web server. This information not only is valuable for the Web administrator to eliminate uninterested pages, or balance load among the servers, but also can help to improve Web-browsing time. In this paper, we propose a comprehensive preprocessing methodology as a prerequisite and first stage for Prefetching application, which has four steps: Data Cleaning, Identification of users & Sessions, and finally the Data Formatting and Summarization. An attempt is made to reduce the quantity of the WUD and thereby improve the quality of WUD for effective use in Prefetching application. Several heuristics have been proposed for cleaning the WUD which is then aggregated and recorded in the relational data model. To validate the efficiency of our preprocessing methodology, several experiments were conducted log files on three different web sites: Academic, Research, Commercial and the results shows that our methodology reduces the Web access log files size down to 72-83% of the initial size and offer richer logs that are structured for application in Prefetching. Index Term-Preprocessing, Prefetching, Web Usage Data,

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Long-term Web Prefetching Algorithms: A Comparative Study

User perceived latency has become a potential problem due to the increase in internet traffic. Web caching is an effective means of reducing user perceived latency. Web prefetching is an attractive solution which relies on web caching to reduce access latency. There are two kinds of algorithms that are currently used for prefetching i.e., linear algorithms and data mining algorithms. Web prefet...

متن کامل

A Survey on Preprocessing Methods for Web Usage Data

World Wide Web is a huge repository of web pages and links. It provides abundance of information for the Internet users. The growth of web is tremendous as approximately one million pages are added daily. Users’ accesses are recorded in web logs. Because of the tremendous usage of web, the web log files are growing at a faster rate and the size is becoming huge. Web data mining is the applicati...

متن کامل

Fuzzy Equivalent Matrix for Discovering Patterns of Web Users Navigation

-World Wide Web provides abundance of information for the Internet users and is a huge repository of web pages and links. The growth of web is tremendous as approximately one million pages are added daily. Web logs record users’ accesses. Because of the tremendous usage of web , the web log files are growing at a faster rate and the size is becoming huge. Web data mining is the application of d...

متن کامل

Web Prefetching with High Accuracy and Low Memory Cost

Prefetching algorithms can effectively reduce Web latency and dramatically improve responsiveness of interactive Web application. We propose a new history-based prefetching algorithm that achieves very high prediction accuracy, generates little overhead traffic, and allows users to bound the amount of memory that it uses. We also propose a method to find accurate upper bounds on the performance...

متن کامل

Web Prefetching: Costs, Benefits and Performance

Due to the fast development of internet services and a huge amount of network traffic, it is becoming an essential issue to reduce World Wide Web user-perceived latency. Although web performance is improved by caching, the benefit of caches is limited. To further reduce the retrieval latency, web prefetching becomes an attractive solution to this problem. Prefetching reduces user access time, b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014